[NV RTX EP] Set Compute Capability only on Turing architecture#25446
Merged
jywu-msft merged 1 commit intomicrosoft:mainfrom Jul 19, 2025
Merged
[NV RTX EP] Set Compute Capability only on Turing architecture#25446jywu-msft merged 1 commit intomicrosoft:mainfrom
jywu-msft merged 1 commit intomicrosoft:mainfrom
Conversation
Contributor
|
Did we ask TRT-RTX guys why setting "current" profile causes perf regression ? |
Contributor
Author
Did not get a response yet on the root cause for this. For current state of TRT RTX this was the change suggested. |
Contributor
|
/azp run Linux QNN CI Pipeline, Win_TRT_Minimal_CUDA_Test_CI, Windows ARM64 QNN CI Pipeline, Windows x64 QNN CI Pipeline, Windows GPU Doc Gen CI Pipeline |
|
Azure Pipelines successfully started running 5 pipeline(s). |
Contributor
Yes, change should be in the TRT RTX. But this WAR is fine for now. |
Contributor
|
ok. the WAR makes sense then. Thanks. |
jywu-msft
approved these changes
Jul 19, 2025
qti-yuduo
pushed a commit
to CodeLinaro/onnxruntime
that referenced
this pull request
Aug 8, 2025
…soft#25446) ### Description <!-- Describe your changes. --> Set compute capability only on Turing arch ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> Setting the native compute capability was causing a regression in performance. @gaugarg-nv @ishwar-raut1 @ankan-ban
sanketkaleoss
pushed a commit
to sanketkaleoss/onnxruntime
that referenced
this pull request
Aug 11, 2025
…soft#25446) ### Description <!-- Describe your changes. --> Set compute capability only on Turing arch ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> Setting the native compute capability was causing a regression in performance. @gaugarg-nv @ishwar-raut1 @ankan-ban
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Description
Set compute capability only on Turing arch
Motivation and Context
Setting the native compute capability was causing a regression in performance.
@gaugarg-nv @ishwar-raut1 @ankan-ban